Temperature dependence of circular DNA topological states 
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Circular double stranded DNA has different topological states which are defined 
by their linking numbers. Equilibrium distribution of linking numbers can be ob- 
tained by closing a linear DNA into a circle by ligase. Using Monte Carlo simulation, 
we predict the temperature dependence of the linking number distribution of small 
circular DNAs. Our predictions are based on flexible defect excitations resulted from 
local melting or unstacking of DNA base pairs. We found that the reduced bending 
rigidity alone can lead to measurable changes of the variance of linking number dis- 
tribution of short circular DNAs. If the defect is accompanied by local unwinding, 
the eff'ect becomes much more prominent. The predictions can be easily investigated 
in experiments, providing a new method to study the micromechanics of sharply bent 
DNAs and the thermal stability of speciflc DNA sequences. Furthermore, the pre- 
dictions are directly applicable to the studies of binding of DNA distorting proteins 
that can locally reduce DNA rigidity, form DNA kinks, or introduce local unwinding. 
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I. INTRODUCTION 

In a closed circular double stranded DNA molecule, the two strands are linked. The 
number of times the two strands wind around each other is a topological invariant, and it is 
called the linking number Lk, an integer that may be positive or negative depending on the 
orientation of the two strands. Due to the right-handed double helical structure of DNA, 
there is an intrinsic linking number Lkg = where is the number of base pairs (bp) 

of the DNA, and 7 = 10.5 is the number of base pairs per helical turn of DNA. It is more 
convenient to describe the topological states of the circular DNA using the linking number 
difference ALk = Lk — Lko. Since Lko is not an integer, ALk is not an integer either. 
However, ALk can only defer by integer numbers. Previous experiments and simulations 
show that, the equilibrium distributions of ALk for circular DNA in the range of 250 - 10,000 
bp may be approximated by Gaussian distributions. A thorough review of linking number 
distributions of circular DNA can be found in l(]. 



It is we^ 
of a DNA 



1 known that the linking number puts a constraint on the twist and the writhe 

ATw + Wr = ALk, (1) 



where ATw is the difference between the actual twist of the circular DNA and the relaxed 
twist of the same DNA of linear form; Wr is the writhe of the DNA backbone. Due to this 
constraint, the distribution of ALk may be obtained from convolution of the distributions 



of ATw and Wr 



p(ALk) = y"rfWrpt^(ALk-WrK,(Wr). (2) 

Please note that an assumption is made where /9t^,(ATw) and ptt,r(AWr) are independent, 
which is a reasonable approximation according to |5|. 

For a linear DNA of a contour length L, its twist energy in /c^T unit can be approximated 
by an harmonic form: i?(ATw) = 27r^(C/L)ATw^, where C = 75 nm is the canonical twist 

nnbn 

persistence length of a B-DNA pj, 16|, 17|, 18|. The harmonic form of the twist energy leads 
to a Gaussian distribution of the twist with a variance of af^ = On the other hand, 

the writhe distribution depends on the bending elasticity model of DNA. In a discretized 
DNA model, a DNA can be considered to be a chain consisting of straight segments of a 
segment length 6 <^ A, where A = 50 nm is the persistence length of linear B-form DNA. 
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The bending energy is carried by the vertices in a circular DNA: 

N 



E = ^ Ei(ii, tj+i), 



(3) 



i=l 



where Ei is the bending energy of the i^^ vertex connecting the two adjacent tangent vectors 
ii and U+i. 

In the worm-hke chain (WLC) model of DNA, Ei = ^(tj+i — UY, where the persistence 
length A is also the correlation length of local tangent vectors |8|, [ol, 
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ll| . Previously, 

Eq. ([2]) was used to obtain the linking number distributions based on the WLC model {4]. 
The predicted p(ALk) were found in good agreement with those measured in experiments 







ij for circular DNAs of 250 - 10,000 bp sized in a wide temperature range of 4 - 37 degree 
Celsius [12, Q. In addition, the WLC model has been supported by numerous other 
experiments including the force-extension measurements of single DNAs of micrometer(s) 



long 



flQ,[iil 



and the "j-factor" measurement to measure the looping probability of DNA 



of larger than 230 bp jsl- It has no doubt that for DNA larger than 200 bp, at temperature 
below 37 degree Celsius, the WLC model with choice of A in 45-53 nm is suitable to describe 
the DNA bending elasticity. 

However, the above experiments and simulations do not rule out the possibility that WLC 
may fail to describe DNA elasticity at sharper bending conditions (for example, small DNA 
loops), or/and at higher temperatures. The most likely defects are melted base pairs or base 
pairs that lose their stacking interactions with its neighboring base pairs. The possibility 
of excitation of such defect was first pointed out by Crick and Klug 15(1 , and recently was 
observed in molecular dynamics simulation 16|, where the defect is an unstacked base-pair 
step that allows formation of > 90 degrees of bending at the defect site. Furthermore, 



such defects were also observed in < 65 bp minicircles recently [17[. We note that both 
melting and unstacking are sensitive to temperature. Theoretically, it has been shown that 
rare excitations of flexible defects may have drastic effects on the looping efficiency of small 
DNA ^Q,^. Such sensitivity to excitations of flexible defects is expected to exist in 
various other aspects of micromechanics of sharply bent DNAs. In this research, the effects 
of the flexible defect excitation on p(ALk) of small circular DNAs are investigated based on 
an assumption that the defects have a reduced bending rigidity. The additional effects of 
possible reduced twist rigidity and unwinding of defect are also considered. 
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II. EFFECTS OF THE DEFECT FLEXIBILITY ON LINKING NUMBER 

DISTRIBUTION 

A defect may change various local mechanical properties, such as bending rigidity, twist 
rigidity, and may lead to unwinding of DNA. In this section, we show that the reduced 
bending rigidity alone is sufficient to cause measurable increase of variance of the linking 
number distribution for small circular DNAs. A DNA bending energy model that includes 
the excitations of flexible defects should contain three critical parameters: the persistence 
length of B-DNA A, the persistence length of the defect A' {A' <^ A), and the energy 
required to excite one such defect /i (in ksT unit). As shown in p^, the vertex energy can 
be straightforwardly written as: 

Ei = (ti+i - t,) + fl6n„l, (4) 

where the rii are two-state variables, indicating whether segment i is either in double helix 
form {ui = 0) or contains a defect {rii = 1); a = A/b and a' = A' /b are the vertex bending 
rigidity of the regular DNA site and the defected DNA site, respectively. In our simulation, 
A = 50 nm is the B-DNA persistence length. A' = 1 nm is close to the bending persistence 
of single stranded DNA, and fi is scanned in the range 4 - oo ksT. The segment length b is 
chosen to be 1 nm = 3 bp. This does not mean the defect size has to be 3 bp large. Instead, 
it means the defect may have a size from 1 bp up to 3 bp, so it includes the defect at single 
base pair step level as suggested in [isl, Q. As such. A' in our model is an average rigidity 
over all possible defects, and /i is the average excitation energy. 

In the above model, the Ising degree of freedom {ni,n2, ■ ■ ■ ,nN} is independent of the 
conformational degree of freedom {ii,i2, ■ ■ ■ ,iN}, so an effective vertex energy that only 
depends on the conformation may be derived by summation over the Ising degree of freedom, 
as shown in |2l|: 

E, = _/n[e-t(*>+i-*»)' + e-^-T(*-+i-*0']. (5) 

Since Eq. (|5]) only depends on conformation, it is more convenient for Monte Carlo (MC) 
simulation. Hence, it was used in our simulation for writhe distribution of circular DNAs. 
A vertex is determined to be defected when its bending angle exceeds a threshold bending 



angle 9c = arccos(l — ^7^) (please refer to Eq. (23) of section E.3 in |2l|). 



Based on Eq. ([5]), MC simulation is used to sample conformations of circular DNA. 
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To update the conformation, a subchain is rotated by a random angle around the straight 
hne connecting two randomly chosen vertices. Metropolis criterion is applied to determine 



whether the new conformation is accepted. During the simulation, the distribution o: 



number is obtained. For each simulated DNA circle, its writhe is computed by 



22 



writhe 



23|: 



where Cij is the unit vector pointing from the j^^ vertex to the i^^ vertex, and dij is the 
distance between the two vertices. Normalized histogram of the simulated writhe numbers 
being divided by the bin size gives the density distribution function p^r(Wr). The volume 
exclusion is included in the simulation in which conformations with dij < 3.4 nm are not 
accepted. 

Fig. [1] shows the average number of defects for circular DNA from 40 nm to 160 nm. 
For /i = 10 ksT, there are < 1 defects in circular DNA bigger than 80 nm, while there are 
about 2 defects in small circular DNA of 40 nm. This is consistent with the result obtained 
previously by transfer matrix calculation [21]. When fi decreases to 6 or even 4 ksT, the 
average number of defects begins to increase with increasing DNA length. At 4 ksT, about 
1 /3 of the DNA is melted. 

In this section, we only consider the effects of the reduced bending rigidity of the defects, 
so we set the twist rigidity of the defect to be unchanged: C = C = 75 nm. Therefore, 
the defect excitation does not affect the twist distribution: pt^(ATw) is a Gaussian with a 
variance: a^^ = -^^^^ The variance is ~ 0.01 for 40 nm DNA and ~ 0.05 for 160 nm DNA. 
The writhe distribution based on energy Eq. ([5]) was obtained by the normalized histogram 
of the recorded writhe numbers using a bin size of 0.01. The normalized frequency is then 
divided by the bin size to obtain the density distribution function p^r(Wr). Fig. [2](a-d) 
shows p^t,r(Wr) for DNAs of four different sizes that are subjected to various excitation 
energies. Clearly, the excitations in general lead to larger fluctuations of Wr. 

Once p^r(Wr) is obtained by simulation, p(ALk) is obtained by numerical convolutions of 
Ptor(Wr) and pf^(ATw). The results are summarized in Fig. [3](a-d). Obviously, the widths 
of the distributions are noticeably increased by the excitations of defects. The variances of 
the distributions of p(ALk) are summarized in Table [T] (the left number in each bracket). 
From the table, the variances can change by a few folds in the range of the excitation energy 
studied. Table I shows that for DNAs > 80 nm and at the excitation energy p > 10 /c^T, 
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the variances are very close to the predictions by the traditional WLC model. This is in 
agreement with the previous DNA cyclization experiments done at temperatures 20 - 30 
degrees Celsius js, [2J], and with the DNA linking number distribution measurements for 
DNAs larger than 200 bp done at temperatures 4-37 degrees Celsius Q,Q,Q]. For DNA 
of 40 nm, the variance at = 10 fc^T is twice of that predicted by WLC model. However, 
the variance is perhaps too small to be measured in experiment. As shown in jl2|, a linking 
number variance as small as 0.03 can be experimentally measured. As such, the effects of 
the defect excitation can be investigated for 80 - 160 nm DNAs at < 8 ksT. As shown 
in Table I, under this condition the variances of the linking numbers are apparently larger 
than that predicted by the WLC model. 



III. EFFECTS OF REDUCED TWIST RIGIDITY AND UNWINDING OF THE 

DEFECT 

In the previous section, we show that with excitation of flexible defects, the flexibility 
of the defects increases the variance of the writhe distribution, and hence the variance of 
the linking number distribution. The mean of the linking number distribution is still zero, 
therefore, the major information of the effects of the defect is only reflected by the changes in 
the variance of linking number distribution. The variance of ALk may be further increased 
if the defect is accompanied by reduced twist rigidity. It is straightforward to show that 
Pfu;(ATw) is still a Gaussian with a variance : 

1 f iN-n)b nb\ 

where n is the number of defects in the DNA, and C is the twist persistence length of the 
defect. In this section, we consider a reduction of the twist persistence length of C" = C/4. 

In addition to changes in the bending rigidity and twist rigidity, the defect may also lead 
to unwinding of DNA. Obviously, the underwinding may affect both the mean and variance 
of the linking number distribution. In B-form DNA, the relaxed twist angle per base pair 
step is about A defected segment of 1 nm (3 bp) is likely to destroy the local geometry 
constraint of the twist, leading to unwinding by an angle 0, hence a relaxed twist angle per 
segment is about — cj). The previous sections considered a limiting case where = 0. In 
this section, we consider the effects of > up to 
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Under these conditions, ptw and p^r are obviously no longer independent from each other 
since they both depend on the excitation of defect. Therefore, the linking number distribu- 
tion is given by equation 

p(ALk) = E^nP/fe(ALk|n) 

(8) 

= Y.Pnj rfWrpt^(ALk - Wr|n)p^,(Wr|n), 

n 

where Pn is the probability of the DNA to contain n defects, and p{x\n), where x refers to 
ALk or Wr or ATw = ALk — Wr, is the corresponding conditional distribution. 

Numerically the linking number distribution can be conveniently calculated directly from 
MC simulation by equation 

1 

p(ALk) = — ^p,^,„^(ALk - Wr,), (9) 

i=l 

where M is the total number of conformations sampled in the MC simulation. At each step 
of the simulation, the writhe number of the conformation is computed and substituted in the 
argument of ptw The expression of ptw,ni is a Gaussian function with a variance determined 
by Eq. (j7]) and average value —riicj), where is the number of defects excited in the i*^ 
conformation. 

We first look at the combined effects of {A' = 1 nm, C = C/A = 18.75 nm, and = 0) 
(Table [H right number in each bracket). It shows that the reduction in twist rigidity of 
C = C/A does not significantly increase the linking number variance: for p > 10 ksT, the 
variances are nearly the same as those of C" = C; For p < 10 fc^T, the variances are only 
slightly larger than those of C = C. This is not surprising: it simply means that, in the 
presence of a flexible defect, the major contribution to the changes in the linking number 
distribution is from the changes in writhe distribution. 

In contrast to the mild effect of C, we expect a dramatic effect of > 0. Firstly, it may 
change the mean of the linking number since unwinding leads to an offset of the relaxed twist 
angle per segment; secondly, it may further increase the linking number distribution since 
the averages of linking number are dependent on number of defects in the DNA, and the 
number of defects is not a constant. Fig. H] shows the linking number distribution of 40 and 
160 nm circular DNAs with excitation energy p = 4 and 8 fc^T. As expected, with increasing 
of unwinding angle, linking number drifts away from the original equilibrium value of zero. 
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and the distribution becomes wider. At the lower excitation energy = 4 fc^T, the average 
hnking number drifts to much lower values due to the larger number of excited defects. 

IV. DISCUSSIONS 

To relate the excitation energy ^ to temperature, the candidates of the possible defects 
should be discussed. Relevant to the DNA base pair stability, we consider two candidates: 
1) the unstacking between two adjacent DNA base pairs, and 2) the melting of one or a few 
DNA base pairs. It was found recently that the temperature sensitive stacking interaction 
is the main stabilizing factor in DNA double helix, while the A ■ T pairing is always unstable 
and the G ■ C pairing does not contribute to the stability. In addition, the base pairing 
was found independent of temperature 26]. However, this does not mean that melting does 
not cost more energy than unstacking. Instead, there is an additional large energy penalty 
for melting one base pair inside an DNA double helix. This energy penalty is related to a 
temperature and sequence insensitive "ring factor" ^ 10^'^, corresponding to an energy 
penalty ~ 6.6 ksT [27]. The real melting energy is roughly the unstacking energy plus this 
large energy penalty. 

On the other hand, the excitation energy yU in our model is defined for a given DNA con- 
formation, while in a bulk measurement all the possible conformations contribute. There- 
fore, to relate it to the unstacking or melting energy Ji measured in the bulk experiments, 
one needs to consider the conformational entropy gain resulted from the defect excitation. 
According to the excitation model Eq. (jlj), for an unconstraint DNA, the probability of a 
segment to be defected is p = (1 + ^^^e^^e^'^"^)^"'^, where iq is the zero*'^ order modified 
spherical Bessel function of the first kind [21]. On the other hand, for a two states lattice 

1 —2a' 

model it should be p = (1 -|- e^)~ . Direct comparison gives ft = fi — ^ In \~ g-2a (please 
not this relation can be derived by comparison between the chain Hamiltonian of the defect 
excitation model and that of the two state lattice model 25j). Using our parameters a = 50, 
and a' = 1, fl ^ fi — 3.7 ksT. Thus, the energies fi = (4, 6, 8, 10, oo) correspond to the bulk 
values fi = (0.3, 2.3, 4.3, 6.3, oo). 

As shown in [26], the energy contributions from stacking to DNA base pair stability are 
1.6 - 3.3 ksT for G ■ G base pair and 1.1 - 2.7 ksT for A ■ T base pair, respectively, in the 
temperature range 20 - 55 degrees Celsius. Hence, the melting energy of one base pair in 
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the temperature range is 8.2 - 9.9 ksT for G • C base pair and 4.4 - 9.3 ksT ior A-T base 
pair, respectively. Obviously, the melting energies measured in bulk experiments are larger 
than our excitation energies. Therefore, one should not expect to see much melting related 
changes in linking number distribution. However, according to the data, our excitation 
energies are sufficient to unstack neighboring base pairs. Although it is unclear whether 
the unstacking defect is a flexible defect, it is reasonable to think that it may lead to the 
formation of a "kink". Since a flexible defect in a small circular DNA is presented as a 
"kink" , the prediction based on flexible defect excitation should also describe the effects of 
kink excitation on the linking number distributions. In addition, unstacking a base pair step 
is likely associated with local unwinding, so the discussion of the eff'ect of unwinding may 
also be relevant. 

As shown in our computation, the excitation of defects can change the linking number 
distribution through three ways: 1) increase the variance of the writhe distribution by 
reduced local bending rigidity (equivalently, formation of local kink), therefore increase the 
linking number variance; 2) increase the variance of the twist distribution by reduced local 
twist rigidity, therefore increase the linking number variance, and 3) reduce the average 
and increase the variance of the twist by local unwinding, therefore drift and widen the 
distribution of linking number. At low excitation energies (i.e., high temperatures), 1) 
and 3) dominate. In addition to study DNA base pair stability, these predictions may be 
useful in other applications. For example, one can use it to study non-speciflc interactions 
between DNA and DNA-distorting proteins, where the protein binding can be treated as the 
excitation of "kink" or "unwinding'' defect. The change in the linking number distribution 
can then be related to the binding affinity, the bending angle, and unwinding angle induced 
by protein binding. 

In summary, we have predicted the temperature dependence of linking number distri- 
butions of small circular DNAs in the temperature range 20 - 55 degrees Celsius. This 
temperature range is convenient for experimental studies. For temperatures below 37 de- 
grees Celsius, one can use the T4 ligase to close DNAs into loops. For temperatures higher 
than 37 degrees Celsius, one can use Taq DNA ligase (it is known active within the range 
45 - 65 degrees Celsius) to close the DNAs into loops. Once the linear DNA is looped, its 
linking number is fixed, so the linking number distribution can be measured at any conve- 
nient temperatures. Although our predictions are based on excitations of flexible defects. 
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they should also describe the effects of excitations of "kinks" . The most likely candidate of 
the defect is the unstacking of one base pair step which is presumably a "kink" . For another 
candidate, the melting of one or a few DNA base pair, costs a much higher energy, therefore 
it is less likely to be excited. We expect to see increase in the variance of linking number 
distribution at high temperatures, which is likely associated with drift in the mean. We also 
emphasize an potential application for studies of DNA-protein interactions by looking at the 
change in linking number distribution due to protein binding. 

Finally, we note that our predictions of temperature dependence of linking number distri- 
bution of short DNAs should not be understood as accurate quantitative predictions. This 
is because the energy model Eq. ([5]) used in the simulation contains two parameters, so 
there are certain degrees of freedom to choose the parameters. The choice of the parameters 
{A',fi) should agree with the predictions by WLC model for DNA not sharply bent and at 
room temperature. From Tabled we see that for /i > 10 ksT, the linking number variances 
converge to those predicted by the WLC model. In addition, previously we showed that 
the choices of A' = 1 nm and /i = 11 agree with the force-extension curve measured 
in single-DNA stretching experiments, and the "j-factor" measurement of > 230 bp DNAs 
2l|. Furthermore, this set of parameters also explain the "j-factor" measurement of 



18|, 



100 bp DNA at 30 degrees Celsius reported in [2J, |28| (but note that another "j-factor" 



measurement of ~ 100 bp DNA done at a lower temperature 21 degrees Celsius agreed with 
the prediction by the traditional WLC model [3]). If one chooses a larger A', then fi be- 
comes correspondingly smaller in order to be in agreement with the experiments. However, 
our predictions of the temperature dependence of linking number distribution of short DNAs 
should still qualitatively hold. 
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FIG. 1: The average number of defects in circular DNA as a function of the excitation energy fi. 
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FIG. 2: Writhe number distributions based on Eq. ([5]) of DNA minicircles of sizes 40 nm (a), 80 
nm (b), 120 nm (c), and 160 nm (d), respectively. For each figure, from outside to inside, the 
excitation energies are 4, 6, 8, 10 ksT, and oo (B-DNA), respectively. 
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FIG. 3: Linking number distributions of DNA minicircles based on Eq. ([5]) of DNA minicircles of 
sizes 40 nm (a), 80 nm (b), 120 nm (c), and 160 nm (d), respectively. For each figure, from outside 
to inside, the excitation energies are 4, 6, 8, 10 /^bT, and oo, respectively. 



TABLE I: Variances of the linking number distributions of DNA minicircles of sizes 40 nm, 80 nm, 



120 nm, and 160 nm, which are 


subject to excitation energies (in 


ksT unit) 4, 6, 


8, 10, and oo. 


In each bracket. 


the left number 


is computed usinj 


r C' = C = 75 nm, while the rig 


ht number was 


computed using 


C' = C/4 = 18.75 nm. 








Size (nm) 


/i = 4 ksT 


fi = 6 ksT 


fj, = 8 ksT 


fi = W ksT 


oo 


40 


(0.07,0.08) 


(0.03,0.04) 


(0.02,0.03) 


(0.02,0.02) 


(0.01,0.01) 


80 


(0.25,0.26) 


(0.09,0.10) 


(0.05,0.05) 


(0.03,0.03) 


(0.03,0.03) 


120 


(0.39,0.47) 


(0.19,0.19) 


(0.07,0.07) 


(0.05,0.05) 


(0.05,0.05) 


160 


(0.55,0.68) 


(0.30,0.31) 


(0.10,0.11) 


(0.07,0.07) 


(0.07,0.07) 
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FIG. 4: Linking number distributions of DNA minicircles. (a) 40 nm, = 8 ksT; (b) 40 nm, 
/X = 4 ksT; (c) 160 nm, = 8 fcsT; (d) 160 nm, /it = 4 ksT. In each figure, from right to left, the 
unwinding angle per defect is ^ = 0, 26, 51, and 103 degrees. C = C/4 = 18.75 nm was used in 
the calculation. 



